Name | Version | Summary | date |
---|---|---|---|
trl | 0.20.0 | Train transformer language models with reinforcement learning. | 2025-07-29 04:10:06 |
trl-fpo | 0.0.14 | Train transformer language models with reinforcement learning. | 2025-01-18 04:51:57 |
nemo-aligner | 0.6.0 | NeMo-Aligner - a toolkit for model alignment | 2025-01-07 23:05:48 |
shtec-rlhf | 1.0.5 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-06-24 05:55:07 |
hour | day | week | total |
---|---|---|---|
91 | 2271 | 10313 | 304353 |